IMAGE PROCESSING DEVICE AND PRINTER 
HAVING THE SAME 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to an image processing 
device and a printer having the same. More particularly, the 
present invention relates to an image processing device 
capable of outputting speech data together with image data, 
and a printer having the same . 

2 . Description Related to the Prior Art 

There are various types of printers according to digital 
techniques. In a printer, a frame recorded in an exposed 
photo film is printed. Also, image data is obtained by use 
of a digital camera, and is input to the printer for 
printing. Furthermore, a user may send the image data to a 
photo finishing agent by use of the Internet, so as to 
produce a print in a digital printing system. This is 
advantageous because the user does not need to come to the 
photo finishing agent directly. 

Among various types of printing service, there is 
suggestion of processing of acoustic information associated 
with the image data. For example, U.S. P. No. 5,546,145 
(corresponding to JP-A 8-076288) and U.S. P. No. 5,692,225 
(corresponding to JP-A 8-082856) disclose writing of speech 
data. In a digital printing system, the speech data 
associated with the image data is retrieved from a memory 
card or other recording medium, is converted into a bar code, 
so the bar code is printed together with an image. In 
playing back the speech data, the bar code is read from the 



print together with a bar code reading pen or a reader of a 
suitable type. 

However, there is a shortcoming in the above playback of 
the speech data, because the speech is original itself as 
5 recorded by the user, is difficult to understand aurally. 
Some nervous user is likely to hesitate to record his or her 
original voice, and may be reluctant for any unknown person 
to hear the original voice. Furthermore, a user may hesitate 
to send his or her original voice by the Internet to a photo 
10 finishing agent together with the image data. 

SUMMARY OF THE INVENTION 
In view of the foregoing problems, an object of the 
present invention is to provide an image processing device 
capable of outputting speech data together with image data in 
15 a reliably recognizable manner, and a printer having the 
same . 

In order to achieve the above and other objects and 
advantages of this invention, an image processing device for 
processing image data includes a speech data input unit for 
20 inputting speech data associated with the image data for 
representing speech. A voice tone convertor subjects the 
speech data to tone conversion. A speech data output unit 
outputs the tone-converted speech data in association with 
the image data . 

25 In a preferred embodiment, the voice tone convertor 

stores plural sets of tone mode information. Furthermore, a 
voice tone selector selects one of the plural sets of the 
tone mode information, wherein the voice tone convertor 
subjects the speech data to tone conversion according to the 

30 selected tone mode information. 
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In another preferred embodiment, the voice tone 
convertor generates conversion data for tone conversion 
control, and sends the conversion data to the speech data 
output unit. The tone -converted speech data is constituted 
5 by the speech data before being converted and the conversion 
data. 

Furthermore, a text data output unit outputs text data 
in association with the image data, the text data 
representing text . 
10 Furthermore, a speech/ text convertor converts the speech 

data from the speech data input unit into the text data. 

Furthermore, a text data input unit is externally 
operable, for inputting the text data. 

The text data includes at least one of a letter, a 
15 number and a sign. 

Furthermore, a text data input unit inputs text data 
associated with the image data for representing text. A 
text/speech convertor converts the text data into speech 
data, and sends the speech data to the speech data output 
20 unit. 

In a preferred embodiment, an image processing device 
for processing image data includes a data input unit for 
inputting first data associated with the image data. A 
convertor converts the first data into second data. A data 
25 output unit outputs the second data in association with the 
image data. 

The first data is text data for representing text, and 
the second data is speech data for representing speech. 

The data output unit further outputs the text data in 
30 association with the image data. 

Furthermore, the first data is speech data for 
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representing speech, the second data is text data for 
representing text . 

In another preferred embodiment, an image processing 
device for processing image data includes a text data input 
5 unit for inputting text data associated with the image data 
for representing text . A data output unit outputs the text 
data and conversion data in association with the image data, 
the conversion data being adapted to production of speech 
according to the text data. 

10 In a preferred embodiment, a printer for printing an 

image to recording material according to image data includes 
a speech data input unit for inputting speech data associated 
with the image data for representing speech. A voice tone 
convertor subjects the speech data to tone conversion. A 

15 speech data recorder records the tone-converted speech data 
to the recording material in association with the image. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The above objects and advantages of the present 
invention will become more apparent from the following 
20 detailed description when read in connection with the 
accompanying drawings, in which: 

Fig. 1 is a block diagram illustrating a digital 
printing system including an image processing device; 

Fig. 2 is an explanatory view in plan, illustrating a 
25 photographic print; 

Fig. 3 is a flow chart illustrating a process of 
modifying and recording speech data; 

Fig. 4 is a block diagram illustrating another preferred 
digital printing system capable of writing speech data bar 
30 code; 

4 



Fig. 5A is an explanatory view illustrating tone- 
converted speech data; 

Fig. 5B is an explanatory view illustrating a 
combination of conversion data and original speech data; 
5 Fig. 6 is a block diagram illustrating still another 

preferred digital printing system capable of writing text 
data. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENT ( S ) OF THE PRESENT INVENTION 

10 In Fig. 1, a digital printing system 10 of the invention 

is illustrated. The digital printing system 10 is 

constituted by a line CCD scanner 11, an image processing 
unit 12, a speech data convertor component 13, a laser 
printer or video printer 14 as image forming unit, and a 

15 paper processor 15. An input machine 16 is a single machine 
including the line CCD scanner 11, the image processing unit 
12 and the speech data convertor component 13. An output 
machine 17 is a single machine including the laser printer 14 
and the paper processor 15. An image processing device 18 of 

20 the present invention consists of a combination of the image 
processing unit 12 and the speech data convertor component 
13. 

The line CCD scanner 11 is a device well-known to those 
skilled in the art, has a photo film holder or carrier for 
25 feeding the exposed photo film, and reads each of images from 
the photo film. The line CCD scanner 11 obtains scanned 
image data, and sends the same to the image processing unit 
12. 

The speech data convertor component 13 has a speech data 
30 input unit 20, a voice tone convertor 21, and a bar code data 
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output unit 22 as speech data output unit. The speech data 
input unit 20 is adapted to inputting speech data stored in 
a recording medium such as a memory card and a magnetic 
recording layer of the photo film at the time of or after an 
5 exposure in a camera. A memory 20a stores the speech data in 
a temporary manner. 

The speech data is read from the memory 20a, and sent to 
the voice tone convert or 21. The voice tone convertor 21 is 
constituted by a known circuit for converting the tone of 

10 voice constituting the speech. A voice tone selector 28 is 
connected to the voice tone convertor 21, and externally 
operated to designate one of modes of tone conversion. To 
convert the voice tone, the speech is separated into a pitch 
component and a spectral envelope component , which are 

15 subjected to processing known in the art. In the present 
embodiment, the voice tone convertor 21 is settable in first 
and second tone modes, and when in the first mode, converts 
the speech into a tone of a standard voice of a female adult, 
and when in the second mode, converts the speech into a tone 

20 of a standard voice of a male adult. Furthermore, voices of 
particular professional announcers , narrators , actors or the 
like may be preset for determining modes of converting the 
tone. Various known techniques of tone conversion can be 
used. For example, JP-A 9-244693 discloses tone conversion 

25 according to a model of speech data of a particular person. 
Furthermore, a personal computer may be used for tone 
conversion in combination of tone converting software stored 
therein. An amplifier 23 and a loud speaker 24 are connected 
with the voice tone convertor 21, and when desired, are used 

30 to play back the speech before or after conversion of the 
tone. It is to be noted that, in the present embodiment, the 
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first and second tone modes are determined for two tones of 
speech. However, two or more tone modes may be determined 
for selection of one of tone converting techniques different 
from one another. 
5 The bar code data output unit 22 converts the tone- 

converted speech data into a bar code data, which is sent to 
the image processing unit 12. 

The image processing unit 12 determines one of image 
frames to be printed according to the scanned image data, and 

10 subjects the scanned image data to image processing known in 
the art. At the time of image processing, the image 
processing unit 12 produces a bar code image according to bar 
code data from the bar code data output unit 22, and effects 
such image synthesis as to insert the bar code image into a 

15 portion under the image frame to be printed. Furthermore, 
the synthesized image is converted into recording data for 
the digital printing, and sends the same to the laser printer 
14. 

The laser printer 14 includes laser light sources of 
20 Red, Green and Blue colors and a modulator unit. According 
to recording image data, the laser light is modulated, and 
then applied to photographic paper. An image is recorded to 
the photographic paper in a form of a latent image by 
scanning and exposure. The paper processor 15 subjects the 
25 exposed photographic paper to various processes including 
color development, bleaching/fixation, water washing, and 
drying. Thus, an image is created in the photographic 
paper . 

In Fig. 2, a print 30 is depicted, which is obtained by 
30 the laser printer 14 from photographic paper as 
photosensitive recording material. An image 31 is printed in 
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the print 30, in which a speech data bar code 32 is also 
recorded. A bar code reader or bar code reading pen is used 
to read the speech data bar code 32, to play back the speech 
data. The speech data, after the tone conversion by the 
5 voice tone convertor 21, is data of speech of easily 
recognizable tone. Note that, instead of using a bar code 
reader specified to the tone conversion of the embodiment, it 
is possible to use a general -purpose scanner, and to read the 
bar code for the purpose of playing back the speech. 

10 In Fig. 3, a flow of printing according to the 

embodiment is depicted. At first, the line CCD scanner 11 
reads each of image frames in the photo film. The image 
frames being read are displayed in a display panel of the 
image processing unit 12 one after another. Also, a memory 

15 card or other recording medium is set in the speech data 
input unit 20, to read the speech data of a selected image 
frame. At the time of accepting an order, there are various 
kinds of information written on a surface of a photograph 
bag, including existence or lack of speech data, and 

20 existence or lack of tone conversion in addition to a 
printing size, a selected one of glossy finish or silk 
surface finish. According to the information, an operator 
operates the system to effect the tone conversion for the 
selected image frame with which speech data exists and 

25 conversion is desired. The tone-converted speech data is 
played back. If the tone-converted speech is regarded as 
acceptable by the operator, then the speech data bar code 32 
after the conversion is recorded with the image 31. The 
print 30 in Fig. 2 is produced. Note that, in the present 

30 embodiment, the operator checks the tone-converted speech. 
However, it is possible to record the tone-converted speech 



data automatically without being checked by an operator. 

In the above embodiment , the speech data is recorded in 
the print in the bar code form. Also, forms in which the 
speech data is recorded may be other than the bar code form. 
5 Furthermore, Fig. 4 illustrates an embodiment in which speech 
data is magnetically recorded. A digital printing system 41 
is provided with a magnetic speech data recorder 40. An 
input machine 43 includes a magnetic speech data output unit 
42, which receives the tone-converted speech data from the 

10 voice tone convertor 21, and converts the same into magnetic 
speech data. A magnetic head 44 in the magnetic speech data 
recorder 40 writes the magnetic speech data 46 to a magnetic 
recording region of a print 4 5 obtained from photographic 
paper as photosensitive recording material. An image 47 is 

15 so disposed in the print 45 that the magnetic recording 
region is lower than the image 47. Alternatively, a magnetic 
recording layer may be overlaid on the entirety or part of a 
rear surface of the print 45. The magnetic speech data 46 
can be recorded magnetically to the rear surface of the print 

20 45. So the image 47 can be disposed in a frame region 
without any margin in a recording surface of the print 45. 
In Fig. 4, elements similar to those in Fig. 1 are designated 
with identical reference numerals . 

Furthermore, it is possible to record the text data 

25 magnetically to the print 45 in a suitable form. 

Also, it is possible to use a recording sheet in which 
an IC memory is incorporated. Speech data may be recorded to 
the IC memory. Connection to the IC memory may be a contact 
type in which a pin is used, and also may be a non-contact 

30 type in which electromagnetic induction is used. If the 
memory has a sufficiently large capacity, it is possible to 
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write both the tone -converted speech data and the original 
speech data. 

In Figs . 5A and 5B , types of speech data to be recorded 
are depicted. In Fig. 5A, tone-converted speech data 33 is 
5 depicted, which is obtained by subjecting the original speech 
data to the conversion of the tone. In Fig. 5B, 

speech/conversion data set 34 is constituted by conversion 
data and original speech data. The tone -converted speech 
data 33 and the speech/conversion data set 34 are used to 

10 create the speech data bar code 32 depicted in Fig. 2. 
Furthermore, the tone -converted speech data 33 and the 
speech/conversion data set 34 are magnetically recorded as 
the magnetic speech data 46 in the magnetic recording region 
of Fig. 4. In Fig. 5B, the speech/conversion data set 34 

15 includes the conversion data, which is a combination of 
designation data for a tone converting control type and 
various parameters. In a player device, tone-converted 
speech data is created according to the original speech data 
and the conversion data, and plays back the speech in the 

20 converted tone. 

Note that , instead of combining a mode designating data 
with parameters to constitute the conversion data, it is 
possible that conversion data is constituted by designation 
data for designating one of voices of a male adult and a 

25 female adult. In this case, a player device outputs speech 
in the tone of male adult by converting the original speech 
data and the designation data for the voice of a male 
adult . 

In Fig. 6, another preferred embodiment is depicted. A 
30 digital printing system 53 includes a text data input unit 
50, a text/speech convertor 51 and a speech/text convertor 
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52. Elements similar to those in Fig. 1 are designated with 
identical reference numerals. 

An input machine 54 includes a data convertor device 55 
in combination with the line CCD scanner 11 and the image 
5 processing unit 12. The data convertor device 55 is 
constructed by the speech data input unit 20, the voice tone 
convertor 21, the bar code data output unit 22, the text data 
input unit 50, the text/speech convertor 51, the speech/text 
convertor 52 and the like. The text data input unit 50 

10 includes a keyboard, mouth and other elements for direct 
inputting of text data. In addition, the text data input 
unit 50 is provided with a reader for reading information 
from a flexible disk, MO or other recording medium, and a 
communication device in connection with the Internet or other 

15 network. The reader and the communication device are used 
for inputting the text data. 

The text data is sent to the text /speech convertor 51 
and also to an output machine 56. A back-printing type of 
text data recorder 57 is incorporated in the laser printer 14 

20 in the output machine 56, and prints letters and the like to 
a back surface of the photographic paper according to the 
text data. An example of the text data recorder 57 is a dot 
impact printer, in which ink is used and has a characteristic 
free from being influence by development in the paper 

25 processor 15. Note that it is also possible to use printers 
of other types free from being influenced by developing 
solution or the like, for example an ink jet printer. 

The text data recorder 57 also may be a printing device 
for use with ink, toner, dye or the like, for example an 

30 electrophotographic device, plotter or the like. 

The text/speech convertor 51 produces speech data from 
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the text data according to a known speech synthesizing 
technique. The text /speech convertor 51 stores plural sets 
of conversion control data which causes conversion to voices 
of a male adult, female adult, male juvenile, and female 
5 juvenile. The speech data is sent to the bar code data 
output unit 22, and converted into bar code data. The image 
processing unit 12 converts the bar code data into a bar 
code . 

Speech data input to the speech data input unit 20 is 
10 sent to the voice tone convertor 21, and also to the 
speech/text convertor 52 in a selective manner. The 
speech/text convertor 52 converts the speech data to text 
data in a conversion technique known to those skilled in the 
art. The text data is sent to the text data recorder 57, 
15 which prints a code, letters or signs to the back surface of 
the photographic paper. 

Note that it is also possible that the text data 
recorder 57 prints speech data in a form of a code, numbers, 
letters or signs to the back surface of the photographic 
20 paper. 

In the present embodiment, the printer has an auxiliary 
construction for recording the text to a recording surface of 
the print in addition to the printing of the text data 
recorder 57. If the text is desired to be recorded to the 

25 recording surface, the text data from the text data input 
unit 50 or the speech/text convertor 52 is sent to the image 
processing unit 12. The image processing unit 12 effects 
synthesis to combine the text image with the frame image. In 
the image synthesis , it is possible to record the text 

30 directly beside a speech bar code instead of inserting the 
text in the image frame. 
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In the above embodiment, the speech is separated into a 
pitch component and a spectral envelope component, which are 
subjected to processing to convert the voice tone. 
Furthermore, conversion may be effected according to the 
5 construction of Fig. 6. Speech data input to the speech data 
input unit 20 can be converted by the speech/ text convert or 
52 into text data, which can be converted by the text/speech 
convertor 51 into speech of a standard tone. 

In the above embodiment, a combination of original 

10 speech data and conversion data is recorded. Also, the tone- 
converted speech data is recorded. In addition, it is 
possible to record text data in a suitable recording form, 
for example in a bar code form. With this construction, a 
player device receives the text data, and plays back the 

15 speech by conversion of the text data into speech, or also by 
tone conversion. 

In the above embodiment , image frames in the photo film 
are read by the scanner. However, image data may be 
retrieved in various manners other than reading of the photo 

20 film. For example, image data and speech data may be 
obtained by use of a digital camera. Image data may be 
obtained by a scanning operation for an original sheet of any 
kind. Image data may be obtained from the Internet or a CD- 
ROM. 

25 A user must bring his or her image data or speech data 

to a photo finishing agent. To this end, it is possible to 
use the Internet or other network, or communication cable of 
telephone, cellular telephone or the like to transmit the 
image data or speech data. 

30 In the above embodiments, speech data is input by a 

camera. However, speech data may be input at the time 
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different from that according to the above embodiments, or 
may be input in a different manner from the above . 
Specifically, speech data may be input from a sound recorder, 
telephone, cellular telephone and the like. Speech data may 
5 be converted into digital data by a personal computer or the 
like. Also, speech data may be received by transmission of 
such digital data through the Internet. Also, the input 
machine may be provided with a section for acceptance of an 
order. At the time of acceptance, speech data associated 

10 with image data is input. The input machine may have a 
display panel, which can cause a user or customer to check 
and confirm a result of the tone conversion. 

In the above embodiment , the print is produced by the 
digital printing system. Furthermore, it is possible to use 

15 the construction of the invention in an analog type of 
printing system in which a light source applies printing 
light to the photo film, and a printing lens focuses the 
printing light on photosensitive material for exposure. In 
such a construction, a bar code exposure station is installed 

20 in addition to a frame exposure station, for printing a 
speech data bar code. In addition to the bar code recording, 
the speech data may be recorded magnetically or in other 
suitable manners . 

It is also to be noted that a code of a certain form may 

25 be predetermined in a manner and may represent both of speech 
data and text data, so as to make it possible to read a 
desired one of text and speech. The code may be recorded 
optically, magnetically, or with ink, toner or dye. Such a 
code may be constituted by a bar code, a train of numbers, 

30 letters or signs, and the like. 

In the above embodiment, the printing system is 
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constituted by the combination of the input machine 16, 54 
and the output machine 17, 56. However, the input machine 
16, 54 and the output machine 17, 56 may be unified in a 
single printer. Also, a device according to the invention 
5 may be an image processing device having the input machine 
16, 54 without the output machine 17, 56. Furthermore, a 
device according to the invention may be an image processing 
device having the input machine 16, 54 without the line CCD 
scanner 11. In the above embodiment, the laser printer 14 is 
10 included in the output machine 17, 56. However, any suitable 
type of an image forming device may be used for printing, for 
example a thermal printer, an ink- jet printer, and the 
like. 

Although the present invention has been fully described 
15 by way of the preferred embodiments thereof with reference to 
the accompanying drawings, various changes and modifications 
will be apparent to those having skill in this field. 
Therefore, unless otherwise these changes and modifications 
depart from the scope of the present invention, they should 
20 be construed as included therein. 
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